NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

4DIFF: 3D-Aware Diffusion Model for Third-to-First Viewpoint Translation

Cheng, Feng; Luo, Mi; Wang, Huiyu; Dimakis, Alex; Torresani, Lorenzo; Bertasius, Gedas; Grauman, Kristen (May 2025, ECCV '24 https://openreview.net/forum?id=nReyoIseTD)

Abstract. We present 4Diff, a 3D-aware diffusion model addressing the exo-to-ego viewpoint translation task—generating first-person (egocentric) view images from the corresponding third-person (exocentric) images. Building on the diffusion model’s ability to generate photorealistic images, we propose a transformer-based diffusion model that incorporates geometry priors through two mechanisms: (i) egocentric point cloud rasterization and (ii) 3D-aware rotary cross-attention. Egocentric point cloud rasterization converts the input exocentric image into an egocentric layout, which is subsequently used by a diffusion image transformer. As a component of the diffusion transformer’s denoiser block, the 3D-aware rotary cross-attention further incorporates 3D information and semantic features from the source exocentric view. Our 4Diff achieves state-of-the-art results on the challenging and diverse Ego-Exo4D multiview dataset and exhibits robust generalization to novel environments not encountered during training. Our code, processed data, and pretrained models are publicly available at https://klauscc.github.io/4diff.
more » « less
Free, publicly-accessible full text available May 19, 2026
Infilling Score: A Pretraining Data Detection Algorithm for Large Language Models

Raoof, Negin; Rout, Litu; Daras, Giannis; Sanghavi, Sujay; Caramanis, Constantine; Shakkottai, Sanjay; Dimakis, Alex (March 2025, ICLR 2025)

In pretraining data detection, the goal is to detect whether a given sentence is in the dataset used for training a Large Language Model LLM). Recent methods (such as Min-K % and Min-K%++) reveal that most training corpora are likely contaminated with both sensitive content and evaluation benchmarks, leading to inflated test set performance. These methods sometimes fail to detect samples from the pretraining data, primarily because they depend on statistics composed of causal token likelihoods. We introduce Infilling Score, a new test-statistic based on non-causal token likelihoods. Infilling Score can be computed for autoregressive models without re-training using Bayes rule. A naive application of Bayes rule scales linearly with the vocabulary size. However, we propose a ratio test-statistic whose computation is invariant to vocabulary size. Empirically, our method achieves a significant accuracy gain over state-of-the-art methods including Min-K%, and Min-K%++ on the WikiMIA benchmark across seven models with different parameter sizes. Further, we achieve higher AUC compared to reference-free methods on the challenging MIMIR benchmark. Finally, we create a benchmark dataset consisting of recent data sources published after the release of Llama-3; this benchmark provides a statistical baseline to indicate potential corpora used for Llama-3 training.
more » « less
Free, publicly-accessible full text available March 26, 2026
Consistent Diffusion Meets Tweedie: Training Exact Ambient Diffusion Models with Noisy Data

Daras, Giannis; Dimakis, Alex; Daskalakis, Constantinos (July 2024, Proceedings of the 41st International Conference on Machine Learning (ICML))

Full Text Available
Consistent Diffusion Meets Tweedie: Training Exact Ambient Diffusion Models with Noisy Data

Daras, Giannis; Dimakis, Alex; Daskalakis, Constantinos (July 2024, Proceedings of the 41st International Conference on Machine Learning (ICML))

Full Text Available
Consistent Diffusion Models: Mitigating Sampling Drift by Learning to be Consistent

Daras, Giannis; Dagan, Yuval; Dimakis, Alex; Daskalakis, Constantinos (December 2023, NeurIPS 2023)

Full Text Available
Which questions should I answer? Salience Prediction of Inquisitive Questions

https://doi.org/10.18653/v1/2024.emnlp-main.1114

Wu, Yating; Mangla, Ritika Rajesh; Dimakis, Alex; Durrett, Greg; Li, Junyi Jessy (January 2024, Proceedings of the Conference on Empirical Methods in Natural Language Processing (published by Association for Computational Linguistics))

Full Text Available
Ambient Diffusion: Learning Clean Distributions from Corrupted Data

Daras, Giannis; Shah, Kulin; Dagan, Yuval; Gollakota, Aravind; Dimakis, Alex; Klivans, Adam R (December 2023, NeurIPS 2023)

Full Text Available
Restoration-degradation beyond linear diffusions: A non-asymptotic analysis for ddim-type samplers

Chen, Sitan; Daras, Giannis; Dimakis, Alex (July 2023, ICML 2023)

Full Text Available
Score-Guided Intermediate Level Optimization: Fast Langevin Mixing for Inverse Problems

Daras, Giannis; Dagan, Yuval; Dimakis, Alex; Daskalakis; Constantinos (January 2022, Proceedings of the 39th International Conference on Machine Learning (ICML))

Full Text Available
Instance-Optimal Compressed Sensing via Posterior Sampling

Jalal, Ajil; Karmalkar, Sushrut; Dimakis, Alex; Price, Eric (January 2021, Proceedings of Machine Learning Research)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records